Picture for Hyunwoo J. Kim

Hyunwoo J. Kim

Latent Bayesian Optimization via Autoregressive Normalizing Flows

Add code
Apr 21, 2025
Viaarxiv icon

ST-VLM: Kinematic Instruction Tuning for Spatio-Temporal Reasoning in Vision-Language Models

Add code
Mar 26, 2025
Viaarxiv icon

Super-class guided Transformer for Zero-Shot Attribute Classification

Add code
Jan 16, 2025
Figure 1 for Super-class guided Transformer for Zero-Shot Attribute Classification
Figure 2 for Super-class guided Transformer for Zero-Shot Attribute Classification
Figure 3 for Super-class guided Transformer for Zero-Shot Attribute Classification
Figure 4 for Super-class guided Transformer for Zero-Shot Attribute Classification
Viaarxiv icon

VidChain: Chain-of-Tasks with Metric-based Direct Preference Optimization for Dense Video Captioning

Add code
Jan 12, 2025
Figure 1 for VidChain: Chain-of-Tasks with Metric-based Direct Preference Optimization for Dense Video Captioning
Figure 2 for VidChain: Chain-of-Tasks with Metric-based Direct Preference Optimization for Dense Video Captioning
Figure 3 for VidChain: Chain-of-Tasks with Metric-based Direct Preference Optimization for Dense Video Captioning
Figure 4 for VidChain: Chain-of-Tasks with Metric-based Direct Preference Optimization for Dense Video Captioning
Viaarxiv icon

EfficientViM: Efficient Vision Mamba with Hidden State Mixer based State Space Duality

Add code
Nov 22, 2024
Viaarxiv icon

Inversion-based Latent Bayesian Optimization

Add code
Nov 08, 2024
Viaarxiv icon

Constant Acceleration Flow

Add code
Nov 01, 2024
Viaarxiv icon

LLaMo: Large Language Model-based Molecular Graph Assistant

Add code
Oct 31, 2024
Viaarxiv icon

LongVU: Spatiotemporal Adaptive Compression for Long Video-Language Understanding

Add code
Oct 22, 2024
Figure 1 for LongVU: Spatiotemporal Adaptive Compression for Long Video-Language Understanding
Figure 2 for LongVU: Spatiotemporal Adaptive Compression for Long Video-Language Understanding
Figure 3 for LongVU: Spatiotemporal Adaptive Compression for Long Video-Language Understanding
Figure 4 for LongVU: Spatiotemporal Adaptive Compression for Long Video-Language Understanding
Viaarxiv icon

Generative Subgraph Retrieval for Knowledge Graph-Grounded Dialog Generation

Add code
Oct 12, 2024
Viaarxiv icon